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We  examine  the  stochastic  properties  of  aggregate  macroeconomic  time  series  from  the 
standpoint  of  fractionally  integrated  models,  and  focus  on  the  persistence  of  economic 
shocks.  We  develop  a  simple  macroeconomic  model  that  exhibits  long-term  dependence, 
a  consequence  of  aggregation  in  the  presence  of  real  business  cycles.  We  derive  the  re- 
lation between  properties  of  fractionally  integrated  macroeconomic  time  series  and  those 
of  microeconomic  data,  and  discuss  how  fiscal  policy  may  alter  their  stochastic  behavior. 
To  implement  these  results  empirically,  we  employ  a  test  for  fractionally  integrated  time 
series  based  on  the  Hurst-Mandelbrot  rescaled  range.  This  test  is  robust  to  short-term 
dependence,  and  is  applied  to  quarterly  and  annual  real  GNP  to  determine  the  sources 
and  nature  of  long-term  dependence  in  the  business  cycle. 
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1.  Introduction. 

Questions  about  the  persistence  of  economic  shocks  currently  occupy  an  important 
place  in  macroeconomics.  Most  controversy  has  centered  on  whether  aggregate  time  series 
are  better  approximated  by  fluctuations  around  a  deterministic  trend,  or  by  a  random 
walk  plus  a  stationary  or  temporary  component.  The  empirical  results  from  these  studies 
are  mixed,  perhaps  because  the  time  series  representation  of  output  may  belong  to  a  much 
wider  clciss  of  processes  than  previously  considered.  In  particular,  earlier  studies  have 
ignored  the  class  of  fractionally  integrated  processes,  which  serves  as  a  compromise  between 
the  two  stochastic  models  usually  considered,  and  which  also  exhibits  an  interesting  type 
of  long-range  dependence.  This  new  approach  also  accords  well  with  the  classical  NBER 
business  cycle  program  exemplified  by  Wesley  Claire  Mitchell,  who  urged  examination  of 
trends  and  cycles  at  all  frequencies. 

Economic  life  does  not  proceed  smoothly:  there  are  good  times,  bad  times,  alternat- 
ing periods  of  prosperity  and  depression,  a  "rhythmical  fluctuation  in  activity."  Recurrent 
downturns  and  crises  take  place  roughly  every  3  to  5  years  and  thus  seem  part  of  a  non- 
periodic  cycle.  Studying  such  cycles  in  detail  has  been  the  main  activity  of  twentieth 
century  macroeconomics.  Even  so,  isolating  cycles  of  these  frequencies  has  been  diffi- 
cult because  the  data  evince  many  other  cycles  of  longer  and  shorter  duration.  Wesley 
Mitchell  (1927,  p.  463)  remarks  "Time  series  also  show  that  the  cyclical  fluctuations  of 
most  [not  all]  economic  processes  occur  in  combination  with  fluctuations  of  several  other 
sorts:  secular  trends,  primary  and  secondary,  seasonal  variations,  and  irregular  fluctua- 
tions." Properly  removing  these  other  influences  has  always  been  controversial.  No  less 
an  authority  than  Irving  Fisher  (1925)  considered  the  business  cycle  a  myth,  akin  to  runs 
of  luck  at  Monte  Carlo.  In  a  similar  vein,  Slutzky  (1937)  suggested  that  cycles  arose  from 
smoothing  procedures  used  to  create  the  data. 

A  similar  debate  is  now  taking  place.  The  standard  methods  of  removing  a  linear  or 
exponential  trend  assume  implicitly  that  business  cycles  are  fluctuations  around  a  trend. 
Other  work  [e.g.  Nelson  and  Plosser  (1982)]  challenges  this  and  posits  stochastic  trends 
similar  to  random  walks,  highlighting  the  distinction  between  temporary  and  permanent 
changes.  Since  empirically  the  cyclical  or  temporary  component  is  small  relative  to  the 
fluctuation  in  the  trend  component  [the  random  walk  part],  business  cycles  look  more 
like  Fisher's  myth.  This  is  important  for  forecasting  purposes  because  permanent  changes 
[as  in  the  case  of  a  random  walk]  today  have  a  large  effect  many  periods  later,  whereas 
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temporary  changes  [as  in  stationary  fluctuations  around  a  trend]  have  small  future  effects. 
The  large  random  walk  component  also  provides  evidence  against  some  theoretical  models 
of  aggregate  output.  Models  that  focus  on  monetary  or  aggregate  demand  disturbances 
as  a  source  of  transitory  fluctuations  cannot  explain  much  output  variation;  supply  side 
or  other  models  must  be  invoked  [e.g.  Nelson  and  Plosser  (1982),  Campbell  and  Mankiw 
(1987);. 

However,  the  recent  studies  posit  a  misleading  dichotomy.  In  stressing  trends  ver- 
sus random  walks,  they  overlook  earlier  work  by  Mitchell  [quoted  above],  Adelman,  and 
Kuznets,  who  have  stressed  correlations  in  the  data  intermediate  between  secular  trends 
and  transitory  fluctuations.  In  the  language  of  the  interwar  NBER,  they  are  missing 
Kondratiev,  Kuznets  and  Juglar  cycles.  In  particular,  in  the  language  of  modern  time 
series  analysis,  the  search  for  the  stochastic  properties  of  aggregate  variables  has  ignored 
the  clciss  of  fractionally  integrated  processes.  These  stochastic  processes,  midway  between 
white  noise  and  a  random  walk,  exhibit  a  long-range  dependence  no  finite  ARMA  model 
can  mimic,  yet  lack  the  permanent  effects  of  an  ARIMA  process.  They  show  promise 
of  explaining  the  lower  frequency  effects,  the  long  swings  emphasized  by  Kuznets  (1930), 
Adelman  (1965)  or  the  effects  which  persist  from  one  business  cycle  to  the  next.  Since  long 
memory  models  exhibit  dependence  without  the  permanent  effects  of  an  ARIMA  process, 
they  may  not  be  detected  by  standard  methods  of  fitting  Box-Jenkins  models.  This  calls 
for  a  more  direct  investigation  of  this  alternative  class  of  stochastic  processes. 

This  paper  examines  the  stochastic  properties  of  aggregate  output  from  the  stand- 
point of  fractionally  integrated  models.  We  introduce  this  type  of  process  in  Section  2, 
reviewing  its  main  properties,  advantages,  and  weaknesses.  Section  3  develops  a  simple 
macroeconomic  model  that  exhibits  long-term  dependence.  Section  4  employs  a  new  test 
for  fractional  integration  in  time  series  to  search  for  long-term  dependence  in  the  data. 
Though  related  to  a  test  of  Hurst's  and  Mandelbrot's,  it  is  robust  to  short-term  depen- 
dence. We  conclude  in  Section  5. 
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2.  Review  of  Fractional  Techniques  in  Statistics. 

A  random  walk  can  model  time  series  that  look  cyclic  but  non-periodic.  The  first 
differences  of  that  series  [or  in  continuous  time,  the  derivative]  should  then  be  white  noise. 
This  is  an  example  of  the  common  intuition  that  differencing  [differentiating]  a  time  se- 
ries makes  it  "rougher,"  whereas  summing  [integrating]  it  makes  it  "smoother."  Many 
macroeconomic  time  series  look  like  neither  a  random  walk  nor  white  noise,  suggesting 
that  some  compromise  or  hybrid  between  the  random  walk  and  its  integral  may  be  useful. 
Such  a  concept  has  been  given  content  through  the  development  of  the  fractional  calculus, 
i.e.,  differentiation  and  integration  to  non-integer  orders.  The  fractional  integral  of  order 
between  0  and  1  may  be  viewed  as  a  filter  that  smooths  white  noise  to  a  lesser  degree  than 
the  ordinary  integral;  it  yields  a  series  that  is  rougher  than  a  random  walk  but  smoother 
than  white  noise.  Granger  and  Joyeux  (1980)  and  Hosking  (1981)  develop  the  time  se- 
ries implications  of  fractional  differencing  in  discrete  time.  Since  several  recent  studies 
have  employed  fractionally  difference  time  series,  we  provide  only  a  brief  review  of  their 
properties  in  Sections  2.1  and  2.2. 


2.1.  Fractional  Differencing. 

Perhaps  the  most  intuitive  exposition  of  fractionally  differenced  time  series  is  via  their 
infinite-order  autoregressive  and  moving-average  representations.  Let  Xt  satisfy: 


(l-I)'^X,     =     et  (2.1) 


where  tf  is  white  noise,  d  is  the  degree  of  differencing,  and  L  denotes  the  lag  operator.  If 
d  =  0  then  Xt  is  white  noise,  whereas  Xt  is  a  random  walk  if  d  =  1.  However,  as  Granger 
and  Joyeux  (1980)  and  Hosking  (1981)  have  shown,  d  need  not  be  an  integer.  From  the 
binomial  theorem,  we  have  the  relation: 


'  The  idea  of  fractional  differentiation  ie  an  old  one,  dating  back  to  an  oblique  reference  by  Leibnit  in  1695,  but  the  subject 
lay  dormant  until  the  19th  century  when  Abel,  Liouville,  and  Riemann,  developed  it  more  fully  Extensive  applications  have 
only  arisen  in  this  century;  see,  for  example,  Oldham  and  Spanier  (1974).  Kolmogorov  (1940)  was  apparently  the  first  to  notice 
its  applications  in  probability  and  statistics. 

^See,  for  example,  Diebold  and  Rudebusch  (1989)  and  Sowell  (1989a, b)  for  further  details. 
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(1-^)'     =     E(-l)'(^)^'  (2-2) 


where  (^)  is  the  binomial  coefficient  extended  to  non-integer  values  of  d  in  the  natural  way. 
By  manipulating  (2.1)  mechanically,  Xt  also  ha.s  an  infinite-order  MA  representation.  In 
particular,  expressing  the  binomial  coefficients  in  terms  of  the  gamma  function,  we  have: 

X.     =     (.-L)-^<,     =     B(L)..         B,  =  pl|^.  (2.3) 

The  particular  time  series  properties  of  Xt  depend  intimately  on  the  value  of  the  differ- 
encing parameter  d.  For  example,  Granger  and  Joyeux  (1980)  and  Hosking  (1981)  show 
that  when  d  is  less  than  ^,  Xt  is  stationary;  when  d  is  greater  than  —  j,  Xt  is  invertible. 
Although  the  specification  in  (2.1)  is  a  fractional  integral  of  pure  white  noise,  the  extension 
to  fractional  ARIMA  models  is  clear. 

The  AR  and  MA  representations  of  fractionally  differenced  time  series  have  many 
applications,  and  illustrate  the  central  properties  of  fractional  processes,  particularly  long- 
term  dependence.  The  MA  coefficients  Bj^  tell  the  effect  of  a  shock  k  periods  ahead,  and 
indicate  the  extent  to  which  current  levels  of  the  process  depend  on  past  values.  How  fast 
this  dependence  decays  furnishes  valuable  information  about  the  process.  Using  Stirling's 
approximation,  we  have: 

Bu     «     -— —  (2.4) 

for  large  k.  Comparing  this  with  the  decay  of  an  AR(l)  process  highlights  a  central 
feature  of  fractional  processes:  they  decay  hyperbolically,  at  rate  k  ,  rather  than  at  the 
exponential  rate  of  p  for  an  AR(l).  For  example,  compare  in  Figure  1  the  autocorrelation 
function  of  the  fractionally  differenced  series  (1  —  L)^^^'^^Xt  =  Cf  with  the  AR(l)  Xt  = 
0.9X(_i  +  £(.  Although  they  both  have  first-order  autocorrelations  of  0.90,  the  AR(l)'s 
autocorrelation  function  decays  much  more  rapidly.  Figure  2a  plots  the  impulse-response 
functions  of  these  two  processes.  At  lag  1  the  MA-coefficients  of  the  fractionally  differenced 


^See  Hosking  (1981)  for  further  details. 
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series  and  the  AR(l)  are  0.475  and  0.900  respectively;  at  lag  10  they  are  0.158  and  0.349, 
and  at  lag  100  they  are  0.048  and  0.000027.  The  persistence  of  the  fractionally  differenced 
series  is  apparent  at  the  longer  lags.  Alternatively,  v/e  may  ask  what  value  of  an  AR(l)'s 
autoregressive  parameter  will,  for  a  given  lag,  yield  the  same  impulse-response  as  the 
fractionally  differenced  series  (2.1).  This  value  is  simply  the  A;-th  root  of  Bf^  and  is  plotted 
in  Figure  2b  for  various  lags  when  d  —  0.475.  For  large  k,  this  autoregressive  parameter 
must  be  very  close  to  unity. 

These  representations  also  show  how  standard  econometric  methods  can  fail  to  detect 
fractional  processes,  necessitating  the  methods  of  Section  4.  Although  a  high  order  ARMA 
process  can  mimic  the  hyperbolic  decay  of  a  fractionally  differenced  series  in  finite  samples, 
the  large  number  of  parameters  required  would  give  the  estimation  a  poor  rating  from  the 
usual  Akaike  or  Schwartz  Criteria.  An  explicitly  fractional  process,  however,  captures  that 
pattern  with  a  single  parameter  d.  Granger  and  Joyeux  (1980)  and  Geweke  and  Porter- 
Hudak  (1983)  provide  empirical  support  for  this  by  showing  that  fractional  models  often 
out-predict  fitted  ARMA  models. 

The  lag  polynomial  B{L)  provides  a  metric  for  the  persistence  of  Xf.  Suppose  Xt 
represents  GNP,  which  falls  unexpectedly  this  year.  How  much  should  this  change  a 
forecast  of  future  GNP?  To  address  this  issue,  define  Cj^  as  the  coefficients  of  the  lag 
polynomial  C{L)  that  satisfies  the  relation  (1  —  L)Xt  =  C{L)et,  where  the  process  Xt 
is  given  by  (2.1).  One  measure  used  by  Campbell  and  Mankiw  (1987)  and  Diebold  and 
Rudebusch  (1989)  is: 


lim  B,,     =     J^C,,     =     C(l)  .  (2.5) 

k=0 


For  large  k,  the  value  of  Bj^  measures  the  response  of  Xt^k  ^o  ^"  innovation  at  time  t,  a 
natural  metric  for  persistence."*  From  (2.4),  it  is  immediate  that  for  0  <  <i  <  1,  C(l)  =  0, 
and  asymptotically  there  is  no  persistence  in  a  fractionally  differenced  series,  even  though 
the  autocorrelations  die  out  very  slowly.  This  holds  true  not  only  for  d  <  ^  (the  stationary 
case),  but  also  for  j  <  d  <  1,  when  the  process  is  nonstationary. 


*But  sec  Cochrane  (1988),  Quah  (1988),  and  Sowell  (1989c)  for  opposing  views. 

'There  has  been  some  confusion  in  the  hterature  on  this  point.  Geweke  and  Porter-Hudak  (1983)  argue  that  C(l)  >  0. 
They  correctly  point  out  that  Granger  and  Joyeux  (1980)  have  made  an  error,  but  then  incorrectly  claim  that  C(l)  ;=  l/r{d). 
If  our  equation  (2  4)  is  correct,  then  it  is  apparent  that  C(l)  =  0  (which  agrees  with  Granger  (1980)  and  Hosking  (1981)]. 
Therefore,  the  focus  of  the  conflict  lies  in  the  approximation  of  the  ratio  r(/c  +  d]/r{k  +  1)  for  large  k.  We  have  used  Stirling's 
approximation.   However,  a  more  elegant  derivation  follows  from  the  functional  analytic  definition  of  the  gamma  function  as 
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From  these  calculations,  it  is  apparent  that  the  long-range  dependence  of  fractional 
processes  relates  to  the  slow  decay  of  the  autocorrelations,  not  to  any  permanent  effect. 
This  distinction  is  important;  an  IMA(l,l)  can  have  small  but  positive  persistence,  but 
the  coefficients  will  never  mimic  the  slow  decay  of  a  fractional  process. 

The  long-range  dependence  of  fractionally  differenced  time  series  forces  us  to  modify 
some  conclusions  about  decomposing  time  series  into  "permanent"  and  "temporary"  com- 
ponents. Although  Beveridge  and  Nelson  (1981)  show  that  non-stationary  time  series  may 
always  be  expressed  as  the  sum  of  a  random  walk  and  a  stationary  process,  the  stationary 
component  may  exhibit  long-range  dependence.  This  suggests  that  the  temporary  compo- 
nent of  the  business  cycle  may  be  transitory  only  in  the  mathematical  sense  and  is,  for  all 
practical  purposes,  closer  to  what  we  think  of  as  a  long  non-periodic  cycle. 

The  presence  of  fractional  differencing  has  yet  another  implication  for  the  Beveridge- 
Nelson  (BN)  approach.  Nelson  and  Plosser  (1982)  argue  that  deterministic  detrending 
may  overstate  the  importance  of  the  business  cycle  if  secular  movements  follow  a  random 
walk.  However,  if  the  cylical  component  is  an  AR(l)  with  a  nearly  unit  root,  McCallum 
(1986)  shows  that  the  BN  decomposition  will  assign  too  much  variance  to  the  permanent 
or  trend  part  and  understate  the  contribution  of  the  temporary  component.^  But  suppose 
the  temporary  component  follows  the  fractionally  differenced  time  series  (2.1).  A  decom- 
position in  the  spirit  of  BN,  i.e.,  differencing  the  series  the  appropriate  [fractional]  number 
of  times,  leaves  no  temporary  component.  However,  taking  first-differences  of  the  series  as 
Plosser  and  Schwert  (1978)  suggest  will  obviously  overstate  the  importance  of  the  cyclical 
component.  For  example,  if  we  difference  (2.1)  once,  instead  of  the  appropriate  fractional 
order,  the  temporary  component  becomes  (1  —  L)^~  e^.  Differencing  by  the  appropriate 
(fractional)  order  leaves  no  temporary  component.  Therefore,  the  presence  of  fractional 
noise  may  reverse  McCallum's  result. 


the  Bolution  to  the  following  recursive  relation  [see,  for  example,  lyanaga  and  Kawada  (1980  Section  179.A)): 

r(i+l)     =     xT{x) 


and  the  conditions; 


r(i)    =     1  lin,    ii^^     =     1 

n  —  ao    n'Tln) 


That  an  AR(1)  with  coefficient  0.99  is  in  any  sense  "temporary"  is  arguable. 

2.7  -  6  -  8.89 


2.2.  Spectral  Representation. 

The  spectrum,  or  spectral  density  [denoted  /(w)]  of  a  time  series  specifies  the  contri- 
bution each  frequency  makes  to  the  total  variance.  Granger  (1966)  and  Adelman  (1965) 
have  pointed  out  that  most  aggregate  economic  time  series  have  a  "typical  spectral  shape" 
where  the  spectrum  increases  dramatically  as  the  frequency  approaches  zero  [/(cj)  — >  oo 
as  w  — >  0].  Most  of  the  power  or  variance  seems  concentrated  at  low  frequencies,  or  long 
periods.  However,  pre-whitening  or  differencing  the  data  often  leads  to  "over-differencing" 
or  "zapping  out  the  low  frequency  component",  and  often  replaces  the  peak  by  a  dip  at  0. 
Fractional  differencing  yields  an  intermediate  result.  The  spectra  of  fractional  processes 
exhibit  peaks  at  0  [unlike  the  flat  spectrum  of  an  ARM  A  process],  but  ones  not  so  sharp  as 
the  random  walk's.  A  fractional  series  hais  a  spectrum  richer  in  low  frequency  terms,  and 
more  persistence.  We  illustrate  this  by  calculating  the  spectrum  of  fractionally  integrated 
white  noise,  and  also  present  several  formulas  needed  later  on. 

Given  Xt  =  (1  —  L)~  if,  the  series  is  clearly  the  output  of  a  linear  system  with  a  white 
noise  input,  so  that  the  spectrum  of  Xt  is: 


a 


2 


^(^)     =     T. ^dT         ^^"^         2  =  e-*",      o'  =  E{ei].  (2.6) 

1  —  2r     ^TT 


The  identity  |l  —  zf  =  2[l  —  cos(t<;)]  implies  that  for  small  u)  we  have: 


fiu)     =     cu;-2^  c^^.  (2.7) 


This  approximation  encompasses  the  two  extremes  of  a  white  noise  [or  a  finite  ARMA 
process]  and  a  random  walk.  For  white  noise,  d  =  0,  and  f{u>)  =  c,  while  for  a  random 
walk,  d  =  1  and  the  spectrum  is  inversely  proportional  to  o;'^.  A  class  of  processes  of 
current  interest  in  the  statistical  physics  literature,  called  l//  noise,  matches  fractionally 
integrated  noise  with  d  =  I. 


''See  Chatfield  (1984,  Chapters  6  and  9). 
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3.  A  Simple  Macroeconomic  Model  with  Long-Term  Dependence. 

Over  half  a  century  ago,  Wesley  Claire  Mitchell  (1927  p. 230)  wrote  that  "We  stand 
to  learn  more  about  economic  oscillations  at  large  and  about  business  cycles  in  particular, 
if  we  approach  the  problem  of  trends  as  theorists,  than  if  we  confine  ourselves  to  strictly 
empirical  work."  Indeed,  gaining  insights  beyond  stylized  facts  requires  guidance  from 
theory.  Models  of  long-range  dependence  may  provide  organization  and  discipline  in  the 
construction  of  economic  paradigms  of  growth  and  business  cycles.  They  can  guide  future 
research  by  predicting  policy  effects,  postulating  underlying  causes,  and  suggesting  new 
ways  to  analyze  and  combine  data.  Ultimately,  examining  the  facts  serves  only  as  a  prelude. 
Economic  understanding  requires  more  than  a  consensus  on  the  Wold  representation  of 
GNP;  it  demands  a  falsifiable  model  based  on  the  tastes  and  technology  of  the  actual 
economy. 

Thus,  before  testing  for  long-range  dependence,  we  develop  a  simple  model  of  economic 
equilibrium  in  which  aggregate  output  exhibits  such  behavior.  It  presents  one  reason  that 
macroeconomic  data  might  show  the  particular  stochastic  structure  for  which  we  test.  It 
also  shows  that  models  can  restrict  the  fractional  differencing  properties  of  time  series,  so 
that  our  test  holds  promise  for  distinguishing  between  competing  theories.  Furthermore, 
the  majcimizing  model  presented  below  connects  long-rajige  dependence  to  central  economic 
concepts  of  productivity,  aggregation,  and  the  limits  of  the  representative  agent  paradigm. 


3.1.  A  Simple  Real  Model. 

One  plausible  mechanism  for  generating  long-range  dependence  in  output,  which  we 
will  mention  here  and  not  pursue,  is  that  production  shocks  themselves  follow  a  fractionally 
integrated  process.  This  explanation  for  persistence  follows  that  used  by  Kydland  and 
Prescott  (1982).  In  general,  such  an  approach  begs  the  question,  but  in  the  present  case 
evidence  from  geophysical  and  meteorological  records  suggests  that  many  economically 
important  shocks  have  long  run  correlation  properties.  Mandelbrot  and  Wallis  (1969), 
for  example,  find  long-range  dependence  in  rainfall,  riverflows,  earthquakes  and  weather 
[measured  by  tree  rings  and  sediment  deposits]. 

A  more  satisfactory  model  explains  the  time  series  properties  of  data  by  producing 
them  despite  white  noise  shocks.  This  section  develops  such  a  model  with  long-range 
dependence,  using  a  linear  quadratic  version  of  the  real  business  cycle  model  of  Long  and 
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Plosser  (1983)  and  aggregation  results  due  to  Granger  (1980).  In  our  multi-sector  model 
the  output  of  each  industry  (or  island)  will  follow  an  AR(l)  process.  Aggregate  output  with 
N  sectors  will  not  follow  an  AR(l)  but  rather  an  ARMA(A'',A'^-1).  This  makes  dynamics 
with  even  a  moderate  number  of  sectors  unmanageable.  Under  fairly  general  conditions, 
however,  a  simple  fractional  process  will  closely  approximate  the  true  ARMA  specification. 
Consider  a  model  economy  with  many  goods  and  a  representative  agent  who  chooses 
a  production  and  consumption  plan.  The  infinitely  lived  agent  inhabits  a  linear  quadratic 
version  of  the  real  business  cycle  model.  The  agent  has  a  lifetime  utility  function  of 
U  =  ^/9'u(C()  where  Ct  is  an  A^xl  vector  denoting  period-t  consumption  of  each  of  the 
A'^  goods  in  our  economy.  Each  period's  utility  function  u{Ct)  is  given  by: 


u{Ct)     =     C[i  -    \c[BCt  (3.1) 


where  l  is  an  A^xl  vector  of  ones.  In  anticipation  of  the  aggregation  considered  later,  we 
assume  B  to  be  diagonal  so  that  C[BCt  =  Yi  ^ti^ff  "^^^  agents  face  a  resource  constraint: 
total  output  Yt  may  be  either  consumed  or  saved,  thus: 


Ct   +   Sti     =     Yt  (3.2) 

where  the  z,  j-th  entry  S^^t  of  the  NxN  matrix  St  denotes  the  quantity  of  good  j  invested 
in  process  i  at  time  t,  and  it  is  assumed  that  any  good  Yjt  may  be  consumed  or  invested. 
Output  is  determined  by  the  random  linear  technology: 

Yt     =     ASt   +   et  (3.3) 

where  et  is  a  (vector)  random  production  shock  whose  value  is  realized  at  the  beginning  of 
period  t  +  1.  The  matrix  A  consists  of  the  input-output  parameters  a^,.  To  focus  on  long- 
term  dependence  we  restrict  A's  form.  Thus,  each  sector  uses  only  its  own  output  as  input, 
yielding  a  diagonal  A  matrix  and  allowing  us  to  simplify  notation  by  defining  Cj  =  a^^.  This 
might  occur,  for  example,  with  a  number  of  distinct  islands  producing  different  goods.  To 
further  simplify  the  problem,  all  commodities  are  perishable  and  capital  depreciates  at  a 
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rate  of  100  percent.  Since  the  state  of  the  economy  in  each  period  is  fully  specified  by  that 
period's  output  and  productivity  shock,  it  is  useful  to  denote  that  vector  Zt  =  [V/    e[]'. 

Subject  to  the  production  function  and  the  resource  constraints  (3.2)  and  (3.3),  the 
agent  maximizes  expected  lifetime  utility: 


{5,}      '     '     '  {St) 


Y,0'-\{Yt-Sti) 


"-    T  =  t 


(3.4) 


where  we  have  substituted  for  consumption  in  (3.4)  using  the  budget  equation  (3.2).  This 
maps  naturally  into  a  dynamic  programming  formulation,  with  a  value  function  V  [Zt]  and 
optimality  equation: 

V[Zt)     =     U^.^[u[Yt-Sti)   +   0E\V{Zt+^)\Zt]\  .  (3.5) 

\St]     K  ) 

With  quadratic  utility  and  linear  production,  it  is  straightforward  to  show  that  V[Zi)  is 
quadratic,  with  fixed  constants  given  by  the  matrix  Riccati  equation  that  results  from 
the  recursive  definition  of  the  value  function.  Given  the  value  function,  the  first  order 
conditions  of  the  optimality  equation  (3.5)  yield  the  chosen  quantities  of  consumption  and 
investment/savings  and  have  closed-form  solutions. 

The  simple  form  of  the  optimal  consumption  and  investment  decision  rules  comes  from 
the  quadratic  preferences  and  the  linear  production  function.  Two  qualitative  features  bear 
emphasis.  First,  higher  output  today  will  increase  both  current  consumption  and  current 
investment,  thus  increasing  future  output.  Even  with  100  percent  depreciation,  no  durable 
commodities,  and  i.i.d.  production  shocks,  the  time-to-build  feature  of  investment  induces 
serial  correlation.  Second,  the  optimal  choices  do  not  depend  on  the  uncertainty  present. 
This  certainty  equivalence  feature  is  clearly  an  artifact  of  the  linear-quadratic  combination. 

The  time  series  of  output  can  now  be  calculated  from  the  production  function  (3.1) 
and  the  optimal  consumption/investment  decision  rules.  Quantity  dynamics  then  come 
from  the  difference  equation: 


y,(+l     -     ^^-^ — ^y,t  +   K,   +  e,t+i  (3.6) 


*See  Sargent  (1987,  Chapter  1)  for  an  excellent  exposition. 
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or 


Yu^l       =      <^^Y^t    +    K,     +    £,(^1  (3.7) 

where  Pj  and  X,  are  fixed  constants.  The  key  qualitative  property  of  quantity  dynamics 
summarized  by  (3.7)  is  that  output  Y^t  follows  an  AR(l)  process.  Higher  output  today 
implies  higher  output  in  the  future.  That  effect  dies  off  at  a  rate  that  depends  on  the 
parameter  a^,  which  in  turn  depends  on  the  underlying  preferences  and  technology. 

The  simple  output  dynamics  for  a  single  industry  or  island  neither  mimics  business 
cycles  nor  exhibits  long-run  dependence.  However,  aggregate  output,  the  sum  across  all 
sectors,  will  show  such  dependence,  which  we  now  demonstrate  by  applying  the  aggre- 
gation results  of  Granger  (1980,1988).  It  is  well-known  that  the  sum  of  two  series,  Xt 
and  Yt,  each  AR(l)  with  independent  error,  is  an  ARMA(2,1)  process.  Simple  induction 
then  implies  that  the  sum  of  A'^  independent  AR(l)  processes  with  distinct  parameters 
has  an  ARMA(A^,A'^  —  1)  representation.  With  over  six  million  registered  businesses  in 
America  (CEA,  1988),  the  dynamics  can  be  incredibly  rich,  and  the  number  of  parame- 
ters unmanageably  huge.  The  common  response  to  this  problem  is  to  pretend  that  many 
different  firms  (islands)  have  the  same  AR(l)  representation  for  output,  which  reduces  the 
dimensions  of  the  aggregate  ARMA  process.  This  "canceling  of  roots"  requires  identical 
autoregressive  parameters.  An  alternative  approach  reduces  the  scope  of  the  problem  by 
showing  that  the  ARMA  process  approximates  a  fractionally  integrated  process,  and  thus 
summarizes  the  many  ARMA  parameters  in  a  parsimonious  manner.  Though  we  consider 
the  ca^e  of  independent  sectors,  dependence  is  easily  handled. 

Consider  the  case  of  A'^  sectors,  with  the  productivity  shock  for  each  serially  uncor- 
related  and  independent  across  islands.  Furthermore,  let  the  sectors  differ  according  to 
the  productivity  coefficient  a^.  This  implies  differences  in  Qj,  the  autoregressive  parameter 
for  sector  I's  output  Y^^.  One  of  our  key  results  is  that  under  some  distributional  as- 
sumptions on  the  ttj's  aggregate  output  Vj"  follows  a  fractionally  integrated  process,  where 
Y^°'  =  X^j_i  Vit  .  To  show  this,  we  approach  this  problem  from  the  frequency  domain  and 
apply  spectral  methods  which  often  simplify  problems  of  aggregation.^  Let  /(w)  denote 
the  spectrum  [spectral  density  function]  of  a  random  variable,  and  let  z  =  e~*'^.  From 
the  definition  of  the  spectrum  as  the  Fourier  transform  of  the  autocovariance  function,  the 


*See  Theil  (1954). 
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spectrum  of  yj^  is: 


a' 


^•(")  =  u^^^-  ''•'' 


Similarly,  independence  implies  that  /(w),  the  spectrum  of  Y^°',  is  the  sum  of  the  spectra 
of  the  individual  Fit's.  The  q:,'s  mezisure  an  industry's  average  output  for  given  input. 
This  attribute  of  the  production  function  can  be  thought  of  as  a  drawing  from  nature,  as 
can  the  variance  of  the  productivity  shocks  e^t  for  each  sector.  Thus,  it  makes  sense  to 
think  of  the  Oj's  as  independently  drawn  from  a  distribution  G[a)  and  the  Oj's  drawn  from 
F{a).  Provided  that  the  £,j  shocks  are  independent  of  the  distribution  of  Qj's,  the  spectral 
density  of  the  sum  can  be  written  as: 


/H     =     ^£[^2]  .   / --J-^dF(a)  (3.9) 

2n  J    \l  —  azr 


If  the  distribution  F{a)  is  discrete,  so  that  it  takes  on  m  (<  A'^)  values,  F^"  will  be  an 
ARMA(m,m  —  1)  process.  A  more  general  distribution  leads  to  a  process  no  finite  ARMA 
model  can  represent.  To  further  specify  the  process,  take  a  particular  distribution  for  F, 
in  this  case  a  variant  of  the  Beta  distribution.  In  particular,  let  a  be  distributed  as 
Beta(p,g),  which  yields  the  following  density  function  for  a: 


dF{a)     =     {  ^(^''^  ~      ~  (3.10) 

0  otherwise 


with  p  >  0  and  q  >  0.^^  Obtaining  the  Wold  representation  of  the  resulting  process 
requires  a  little  more  work.  As  a  first  step,  expand  the  spectrum  (3.10)  by  long  division. 
Substituting  this  expansion  and  the  Beta  distribution  (3.10)  into  the  expression  for  the 
spectrum  and  simplifying  [using  the  relation  z  +  z  =  2cos(a;)]  yields: 


'"  Granger  (1980)  conjectures  that  the  particular  distribution  is  not  essential  but  only  proves  the  result  for  Beta  distributions. 
"  For  a  discussion  of  the  variety  of  shapes  the  Beta  distribution  takes  as  p  and  q  vary,  see  Johnson  and  Kott  (1970) 
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/M     =     y^    ^2   +   2^a^os(Mj^(^c.2p-l(l-a2)9-2da.  (3.11) 


Then  the  coefficient  of  cos(A;u;)  is: 


/: 


2a^ 


c,2p+^-l(i  _  ^2)9-2^^  _  (3.12) 


Since  the  spectral  density  is  the  Fourier  transform  of  the  autocovariance  function,  (3.12)  is 
the  A:-th  autocovariance  of  Yf'^.  Furthermore,  the  integral  defines  a  Beta  function,  so  (3.12) 
simplifies  to  0{p  +  k/2,q  —  l)//?(p,  g).  Dividing  by  the  variance  gives  the  autocorrelation 
coefficients  which  reduce  to 


*'         r(p)      r(p +  1  +  0+1) 


which,  again  using  the  result  from  Stirling's  approximation  T{a  +  k)/T{b  +  A;)  «  k°'~",  is 
proportional  (for  large  lags)  to  k^~^.  Thus,  aggregate  output  Yf^  follows  a  fractionally 
integrated  process  of  order  d  —  1  —  f"  Furthermore,  as  an  approximation  for  long  lags, 
this  does  not  necessarily  rule  out  interesting  correlations  at  higher,  e.g.  business  cycle, 
frequencies.  Similarly,  co-movements  can  arise  as  the  fractionally  integrated  income  process 
may  induce  fractional  integration  in  other  observed  time  series.  This  has  arisen  from  a 
maximizing  model  given  tastes  and  technologies.-^ 

In  principle,  all  parameters  of  the  model  may  be  estimated,  from  the  distribution  of 
production  function  parameters  to  the  variance  of  output  shocks.  Though  to  our  knowledge 
no  one  has  explicitly  estimated  the  distribution  of  production  function  parameters,  many 
people  have  estimated  production  functions  across  industries. ^^  One  of  the  better  recent 
studies  disaggregates  to  45  industries.      For  our  purposes,  the  quantity  closest  to  a,  is  the 


''Two  additional  points  are  worth  emphasizing  First,  the  Beta  distribution  need  not  be  over  (0,1)  to  obtain  these  results, 
only  over  (a,l)    Second,  it  is  indeed  possible  to  vary  the  a.'s  so  that  a,  has  a  Beta  distribution. 

'^Leontief,  in  his  classic  study  (1976)  reports  own-industry  output  coefficients  for  10  sectors:  how  much  an  extra  unit  of  food 
will  increase  food  production    These  vary  from  0.06  (fuel)  to  1.24  (other  industries). 

'*  Jorgenaon,  GoUop  and  Fraumeni  (1987). 
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value-weighted  intermediate  product  factor  share.  Using  a  translog  production  function, 
this  gives  the  factor  share  of  inputs  coming  from  industries,  excluding  labor  and  capital. 
These  range  from  a  low  of  0.07  in  radio  and  TV  advertising  to  a  high  of  0.811  in  petroleum 
and  coal  products.  Thus,  even  a  small  amount  of  disaggregation  reveals  a  large  dispersion 
and  suggests  the  plausibility  and  significance  of  the  simple  model  presented  in  this  section. 

3.2.  Welfare  Implications. 

Taking  a  policy  perspective  raises  a  natural  question  about  the  fractional  properties 
of  national  income.  Does  long-term  dependence  have  welfare  implications?  Do  agents  care 
that  they  live  in  such  a  world? 

People  who  must  predict  output  or  forecast  sales  will  care  about  the  fractional  nature 
of  output,  but  fractional  processes  can  have  normative  implications  as  well.  Following 
Lucas  (1987),  this  section  estimates  the  welfare  costs  of  economic  instability  under  different 
regimes.  We  can  decide  if  people  care  whether  their  world  is  fractional.  For  concreteness, 
let  the  typical  household  consume  C(,  evaluating  this  via  a  utility  function: 


U 


.l-CT 


^      1  -c 


oo 
t=0 


0' 


-E\Cl 


l-ai 


(3.14) 


Also  assume: 


InCt     =     (l  +  A)^^^LSt 
fc=0 


(3.15) 


where  rjj  =  Ine^.  The  A  term  measures  compensation  for  variations  in  the  process  (^{L). 
With  Tji  normally  distributed  with  mean  0  and  variance  1,  the  compensating  fraction  A 
between  two  processes  4>  and  xJj  is: 


1  +  A 


exp 


oo 


Ln-o)^(4-4) 


k=0 


(3.16) 


Evaluating  this  using  a  realistic  (7  =  5,  again  comparing  an  AR(l)  with  p  =  0.9  and 
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fractional  process  of  order  1/4,  we  find  that  A  =  —0.99996  [this  number  looks  larger  than 
those  in  Lucas  because  the  process  is  in  logs  rather  than  in  levels]. ^^  For  comparison,  this 
is  the  difference  between  an  AR(l)  with  p  of  0.90  and  one  with  p  of  0.95.  This  calculation 
provides  a  rough  comparison  only.  When  feasible,  welfare  calculations  should  use  the 
model  generating  the  processes,  as  only  it  will  correctly  account  for  important  specifics, 
such  as  labor  supply  or  distortionary  taxation. 


'^  We  calculate  this  using  (2.4)  and  the  Hardy-Littlewood  approximation  for  the  resulting  Riemann  Zeta  Function,  following 
Titchmarsh,  1951,  sec    4  11 
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4.  R/S  Analysis  of  Real  Output, 

The  results  of  Section  3  show  that  simple  aggregation  may  be  one  source  of  long-term 
dependence  in  the  business  cycle.  In  this  section  we  employ  a  method  for  detecting  long 
memory  and  apply  it  to  real  GNP.  The  technique  is  based  on  a  simple  generalization  of  a 
statistic  first  proposed  by  the  English  hydrologist  Harold  Edwin  Hurst  (1951),  which  has 
subsequently  been  refined  by  Mandelbrot  (1972,  1975)  and  others.^  Our  generalization 
of  Mandelbrot's  statistic  [called  the  "rescaled  range"  or  "range  over  standard  deviation" 
or  R/S]  enables  us  to  distinguish  between  short  and  long  run  dependence,  in  a  sense  to  be 
made  precise  below. 

We  define  our  notions  of  short  and  long  memory  and  present  the  test  statistic  in  Section 
4.1.  In  Section  4.2  we  present  the  empirical  results  for  real  GNP  Section  4.3;  we  find  long- 
term  dependence  in  log-linearly  detrended  output,  but  considerably  less  dependence  in 
the  growth  rates.  To  interpret  these  results,  we  perform  several  Monte  Carlo  experiments 
under  two  null  and  two  alternative  hypotheses  and  report  these  results  in  Section  4.3. 


4.1.  The  Rescaled  Range  Statistic. 

To  develop  a  method  of  detecting  long  memory,  we  must  be  precise  about  the  dis- 
tinction between  long-term  and  short-term  statistical  dependence.  One  of  the  most  widely 
used  concepts  of  short-term  dependence  is  the  notion  of  "strong-mixing"  due  to  Rosenblatt 
(1956),  which  is  a  measure  of  the  decline  in  statistical  dependence  of  two  events  separated 
by  successively  longer  spans  of  time.  Heuristically,  a  time  series  is  strong-mixing  if  the 
maximal  dependence  between  any  two  events  becomes  trivial  as  more  time  elapses  between 
them.  By  controlling  the  rate  at  which  the  dependence  between  future  events  and  those  of 
the  distant  past  declines,  it  is  possible  to  extend  the  usual  laws  of  large  numbers  and  central 
limit  theorems  to  dependent  sequences  of  random  variables.  Such  mixing  conditions  have 
been  used  extensively  by  White  (1982),  White  and  Domowitz  (1984),  and  Phillips  (1987) 
for  example,  to  relax  the  assumptions  that  ensure  consistency  and  asymptotic  normality  of 
various  econometric  estimators.  We  adopt  this  notion  of  short-term  dependence  as  part  of 
our  null  hypothesis.  As  Phillips  (1987)  observes,  these  conditions  are  satisfied  by  a  great 
many  stochastic  processes,  including  all  Gaussian  finite-order  stationary  ARMA  models. 
Moreover,  the  inclusion  of  a  moment  condition  also  allows  for  heterogeneously  distributed 


'*See  Mandelbrot  and  Taqqu  (1979)  and  Mandelbrot  and  Wallis  (1968,  1969a-c). 
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sequences  [such  as  those  exhibiting  heteroscedasticity],  an  especially  important  extension 
in  view  of  the  non-stationarities  of  real  GNP. 

In  contract  to  the  "short  memory"  of  weakly  dependent  [i.e.,  strong-mixing]  processes, 
natural  phenomena  often  display  long-term  memory  in  the  form  of  non-periodic  cycles. 
This  has  lead  several  authors,  most  notably  Mandelbrot,  to  develop  stochastic  models 
that  exhibit  dependence  even  over  very  long  time  spans.  The  fractionally  integrated  time 
series  models  of  Mandelbrot  and  Van  Ness  (1968),  Granger  and  Joyeux  (1980),  and  Hosking 
(1981)  are  examples  of  these.  Operationally,  such  models  possess  autocorrelation  functions 
that  decay  at  much  slower  rates  than  those  of  weakly  dependent  processes,  and  violate 
the  conditions  of  strong-mixing.  To  detect  long-term  dependence  [also  called  "strong 
dependence"],  Mandelbrot  suggests  using  the  range  over  standard  deviation  (R/S)  statistic, 
also  called  the  "rescaled  range,"  which  was  developed  by  Hurst  (1951)  in  his  studies  of 
river  discharges.  The  R/S  statistic  is  the  range  of  partial  suras  of  deviations  of  a  time  series 
from  its  mean,  rescaled  by  its  standard  deviation.  In  several  seminal  papers,  Mandelbrot 
demonstrates  the  superiority  of  R/S  to  more  conventional  methods  of  determining  long-run 
dependence  [such  as  autocorrelation  analysis  and  spectral  analysis]. 

In  testing  for  long  memory  in  output,  we  employ  a  modification  of  the  R/S  statistic 
that  is  robust  to  weak  dependence.  In  Lo  (1989),  a  formal  sampling  theory  for  the  statistic 
is  obtained  by  deriving  its  limiting  distribution  analytically  using  a  functional  central  limit 
theorem.      We  use  this  statistic  and  its  asymptotic  distribution  for  inference  below. 

Let  Xt  denote  the  first-difference  of  log-GNP;  we  assume  that: 

Xt     =     ^  +   ^  (4.1) 

where  n  is  an  arbitrary  but  fixed  parameter.  Whether  or  not  Xt  exhibits  long-term  memory 
depends  on  the  properties  of  {ft}-  As  our  null  hypothesis  H,  we  assume  that  the  sequence 
of  disturbances  {e^}  satisfies  the  following  conditions: 


{A\)       E\et\   =   0  for  all  t. 

*'See  Mandelbrot  {1972,  1975),  Mandelbrot  and  Taqqu  (1979),  and  Mandelbrot  and  Wallis  (1968,  1969a-c). 

'*Thi8  Btatietic  is  asymptotically  equivalent  to  Mandelbrot's  under  independently  and  identically  distributed  observations, 
however  Lo  (1989)  shows  that  the  original  R/S  statistic  may  be  significantly  biased  toward  rejection  when  the  time  series  is 
short-term  dependent.  Although  aware  of  this  bias,  Mandelbrot  (1972,  1975)  did  not  correct  for  it  since  his  focus  was  on  the 
relation  of  the  R/S  statistic's  logarithm  to  the  logarithm  of  the  sample  site,  which  involves  no  statistical  inference;  such  a 
relation  clearly  is  unaffected  by  short-term  dependence 
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{A2)       suptE\\et\^]    <    oo  for  some /?  >  2. 


{A3)       a^    =    limn- 


-ni^U^if 


exists  and  a     >   0. 


{A4)       {et}  is  strong-mixing  with  mixing  coefficients  a;,  that  satisfy 


19 


°°      l_i 


k=l 

Condition  [Al)  is  standard.  Conditions  {A2)  through  {A4)  are  restrictions  on  the  maximal 
degree  of  dependence  and  heterogeneity  allowable  while  still  permitting  some  form  of  the 
law  of  large  nimibers  and  the  [functional]  central  limit  theorem  to  obtain.  Note  that 
we  have  not  assumed  stationarity.  Although  condition  (.42)  rules  out  infinite  variance 
marginal  distributions  of  et  such  as  those  in  the  stable  family  with  characteristic  exponent 
less  than  2,  the  disturbances  may  still  exhibit  leptokurtosis  via  time-varying  conditional 
moments  [e.g.  conditional  heteroscedasticity].  Moreover,  since  there  is  a  trade-off  between 
conditions  {A2)  and  {A4),  the  uniform  bound  on  the  moments  may  be  relaxed  if  the  mixing 
coefficients  decline  faster  than  {A4)  requires.  For  example,  if  we  require  to  et  have  finite 
absolute  moments  of  all  orders  [corresponding  to  /?  — >  oo],  then  a^  must  decline  feister 
than  l/k.  However,  if  we  restrict  et  to  have  finite  moments  only  up  to  order  4,  then  aj^ 
must  decline  faster  than  l/k"^. 

Conditions  {Al)  —  {A4)  are  satisfied  by  many  of  the  recently  proposed  stochastic 
models  of  persistence,  such  as  the  stationary  AR(l)  with  a  near-unit  root.  Although  the 
distinction  between  dependence  in  the  short  versus  long  runs  may  appear  to  be  a  matter 
of  degree,  strongly  dependent  processes  behave  so  diff"erently  from  weakly  dependent  time 
series  that  our  dichotomy  seems  most  natural.  For  example,  the  spectral  densities  of 
strongly  dependent  processes  are  either  unbounded  or  zero  at  frequency  zero.  Their  partial 
sums  do  not  converge  in  distribution  at  the  same  rate  as  weakly  dependent  series.  And 
graphically,  their  behavior  is  marked  by  cyclic  patterns  of  all  kinds,  some  that  are  virtually 
indistinguishable  from  trends. '^^ 

Consider  a  sample  Xi,  X2,  ...,Xn  and  let  Xn  denote  the  sample  mean  ^  ^  X,.  Then 
the  modified  re-scaled  range  statistic,  which  we  shall  call  Qn,  is  given  by: 

'*For  the  precise  definition  of  strong-mixing,  and  for  further  details,  see  Rosenblatt  (1956),  White  (1984),  and  the  papers  in 
Eberlein  and  Taqqu  (1986). 

'°See  Herrndorf  (1985).  Note  that  one  of  Mandelbrot's  (1972)  arguments  in  favor  of  R/S  analysis  is  that  finite  second 
moments  are  not  required.  This  is  indeed  the  case  if  we  are  interested  only  in  the  almost  sure  convergence  of  the  statistic. 
However,  since  we  wish  to  derive  its  limiting  distribution  for  purposes  of  inference,  a  stronger  moment  condition  is  needed. 

^'See  Mandelbrot  (1972)  for  further  details 

2.7  -  18  -  8.89 


Qn 


^n{g) 


-    -    ;=1  -    -    3=1 


(4.2) 


where 


lig)      =      ^E(^;-^n)'    +    ^E'^;(9){      i:    (X.-^n)(X._,-X,)   I    (4.3) 


-      ^2    +    2^u;j(9)7y 


w,(g)    =    1 


9+1 


q  <  n. 


and  a^  and  ^.  are  the  usual  sample  variance  and  autocovariance  estimators  of  X.  Qn  is  the 
range  of  partial  sums  of  deviations  of  Xj  from  its  mean,  normalized  by  an  estimator  of  the 
partial  sum's  standard  deviation  divided  by  n.  The  estimator  ^,1(9)  involves  not  only  sums 
of  squared  deviations  of  X.,  but  also  its  weighted  autocovariances  up  to  lag  q;  the  weights 
u)j{q)  are  those  suggested  by  Newey  and  West  (1987),  and  yields  an  estimator  o^iq)-  that 
is  always  positive.  Theorem  4.2  of  Phillips  (1987)  demonstrates  the  consistency  o(  an{q) 
under  the  following  conditions: 


{A2')      supt  E\\et\'^^]    <    00  for  some /?>  2. 

{A5)       As  n  increases  without  bound,  q  also  increases  without  bound  such  that  q  ~ 
o(nV4). 

The  choice  of  the  truncation  lag  g  is  a  delicate  matter.  Although  q  must  increase  with 
[but  at  a  slower  rate  than]  the  sample  size,  Monte  Carlo  evidence  suggests  that  when  q 
becomes  large  relative  to  the  number  of  observations,  asymptotic  approximations  may  fail 
dramatically.  However,  q  cannot  be  chosen  too  small  otherwise  the  effects  of  higher-order 
autocorrelations  may  not  be  captured.  The  choice  of  q  is  clearly  an  empirical  issue  and 
must  therefore  be  chosen  with  some  consideration  of  the  data  at  hand. 


d„(9)  ie  also  an  estimator  of  the  spectral  density  function  of  Xi  at  frequency  lero,  using  a  Bartlett  window. 
^^  See,  for  example,  Lo  sind  MacKinlay  (1989) 
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If  the  observations  are  independently  and  identically  distributed  with  variance  o^,  our 
normalization  by  dn{^)  is  asymptotically  equivalent  to  normalizing  by  the  usual  standard 
deviation  estimator  Sn  =  [^  IZi(-^j  ~  -X'n)  p  •  The  resulting  statistic,  which  we  call  Qn, 
is  precisely  the  one  proposed  by  Hurst  (1951)  and  Mandelbrot  (1972): 


Qr 


Max 
l</t< 


(4.4) 


To  perform  statistical  inference  with  the  standardized  re-scaled  range  F^  =  Qn/y/^, 
we  require  its  distribution.  Although  its  finite-sample  distribution  is  not  apparent,  a  large- 
sample  approximation  has  been  derived  in  Lo  (1989)  under  assumptions  {Al),  (^2'),  and 

(A3) (-45).  In  particular,  the  limiting  distribution  of  V^,  which  corresponds  to  a  random 

variable  we  call  V ,  has  the  following  c.d.f.  and  p.d.f.: 


oo 


Fv{v)      =     1    +    2^(l-4fc2v2)e-2(H^ 


Jb=l 


k=l 


(4.5) 
(4.6) 


Using  Fy ,  critical  values  may  readily  be  calculated  for  tests  of  any  significance  level.  The 
most  commonly  used  values  are  reported  in  Table  1.  The  moments  of  V  are  also  easily 
computed  using  fy\  it  is  straightforward  to  show  that  E\V]  =  y/^  and  ^^[V^]  =  ^,  thus 
the  mean  and  standard  deviation  of  V  are  approximately  1.25  and  0.27  respectively.  The 
distribution  and  density  functions  are  plotted  in  Figure  3.  Observe  that  the  distribution 
is  positively  skewed  and  most  of  its  mass  falls  between  |  and  2. 

Although  Fy  completely  characterizes  the  rescaled  range  statistic  under  the  null  hy- 
pothesis of  short-range  dependence,  its  behavior  under  the  alternative  of  long-range  de- 
pendence is  considerably  different.  Lo  (1989)  shows  that  for  the  fractionally  differenced 
alternative  (2.1),  the  normalized  rescaled  range  V^  diverges  to  infinity  in  probability  when 
d  G  (0,2)  and  converges  to  zero  in  probability  when  d  G  (— j,0).  Therefore,  at  least 
against  (2.1),  a  test  for  long-term  memory  based  on  Vn  is  consistent. 

'*In  fact,  Lo  (1989)  shows  that  such  a  test  is  consistent  against  a  considerably  more  general  class  of  alternatives. 
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4.2.  Empirical  Results  for  Real  Output. 

We  apply  our  test  to  two  time  series  of  real  output:  quarterly  postwar  real  GNP  from 
1947:1  to  1987:4,  and  the  annual  Friedman  and  Schwartz  (1982)  series  from  1869  to  1972. 
These  results  are  reported  in  Table  2.  The  first  row  of  numerical  entries  are  estimates  of 
the  classical  rescaled  range  V^  which  is  not  robust  to  short-term  dependence.  The  next 
eight  rows  are  estimates  of  the  modified  rescaled  range  Vn{q)  for  values  of  q  from  1  to  8. 
Recall  that  q  is  the  truncation  lag  of  the  estimator  of  the  spectral  density  at  frequency 
zero.  Reported  in  parentheses  below  the  entries  for  Vn{q)  are  estimates  of  the  percentage 
bieis  of  the  statistic  Vn,  and  is  computed  as  100  •  [(^n/^n(9))  ~  l]- 

The  first  column  of  nxmierical  entries  in  Table  2  indicate  that  the  null  hypothesis  of 
short-term  dependence  for  the  first-diff"erence  of  log-GNP  cannot  be  rejected  for  any  value 
of  q.  The  classical  rescaled  range  statistic  also  supports  the  null  hypothesis.  The  results 
for  the  Friedman  and  Schwartz  series  are  similar.  When  we  log-linearly  detrend  real  GNP, 
the  results  diff"er  considerably.  The  third  column  of  numerical  entries  in  Table  2  show 
that  short-term  dependence  may  be  rejected  for  log-linearly  detrended  quarterly  output 
with  values  of  q  from  1  to  4.  That  the  rejections  are  weaker  for  larger  q  is  not  surprising 
since  additional  noise  arises  from  estimating  higher-order  autocorrelations.  When  values 
of  q  beyond  4  are  used,  we  no  longer  reject  the  null  hypothesis  at  the  5  percent  level  of 
significance.  Finally,  using  the  Friedman  and  Schwartz  time  series,  we  only  reject  with  the 
classical  rescaled  range  and  with  Vn{l). 

The  tests  on  the  diff"erenced  series  ought  to  be  particularly  striking,  especially  since 
the  classical  version  [Vn(0)]  rejects  too  often,  on  evidence  of  merely  short  term  dependence. 
Even  over-differencing  is  not  a  problem:  if  the  series  has  a  fractional  component,  so  will 
its  first  diff"erence,  and  the  test  ought  to  pick  that  up  as  well. 

The  detrended  series  show  more  evidence  of  dependence,  but  even  there  the  signifi- 
cance disappears  when  only  a  year  of  lags  are  included.  Partly  due  to  decreasing  size  and 
power,  at  these  lags  the  test  may  not  adequately  control  for  short  term  dependence.  In 
addition,  finding  more  significant  autocorrelations  in  log-linearly  detrended  GNP  is  pre- 
cisely the  spurious  periodicity  suggested  by  Nelson  and  Kang  (1981).  Their  results  are 
particularly  cogent  since  they  show  that  inappropriate  detrending  introduces  a  great  deal 
of  power  into  low  frequencies. 

Taken  as  a  whole  then,  our  results  accept  the  null  hypothesis  of  short  term  depen- 
dence.   Equivalently,  they  reject  the  alternative  of  long  term  dependence;  GNP  has  no 
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fractional  component.  This  finding  deserves  some  notice  simply  because  of  the  many  pre- 
vious attempts  (direct  or  indirect)  to  resolve  the  issue.  From  the  path-breaking  early 
work  of  Mandelbrot  and  Wallis  (1969b)  and  Adelman  (1963)  to  the  sophisticated  recent 
estimates  of  Diebold  and  Rudebusch  (1989),  econometricians  have  not  conclusively  estab- 
lished the  existence  or  non-existence  of  long  term  dependence.  In  particular  the  estimates 
of  Diebold  and  Rudebusch  have  rather  large  standard  errors,  leaving  them  unable  to  reach 
precise  conclusions  about  the  existence  of  fractional  differencing.  Our  test  statistic  has  a 
tighter  distribution,  and  in  conjunction  with  the  size  and  power  results  reported  below, 
allow  more  definite  conclusions.  Though  by  no  means  the  final  work  on  the  subject,  our 
tests  present  strong  evidence  against  long  term  dependence. 

Our  main  concern  is  how  these  results  improve  our  ability  to  model  the  aggregate 
economy,  whether  in  the  purely  statistical  sense  of  estimating  the  right  stochcistic  process 
or  in  the  theoretical  sense  of  developing  the  appropriate  equilibrium  structure.  From  either 
perspective,  finding  only  short  term  dependence  simplifies  some  matters  while  complicating 
others. 

From  the  standpoint  of  obtaining  a  correct  statistical  representation,  our  results  sub- 
stantially circumscribe  the  class  of  stochastic  processes  needed  to  characterize  GNP.  This 
simplifies  the  problem,  whether  the  goal  be  to  measure  the  persistence  of  economic  shocks, 
assess  the  welfare  costs  of  economic  instability,  or  derive  the  correct  theory.  The  standard 
models  of  trend  (perhaps  stochastic)  and  finite  ARMA  adequately  capture  the  stochas- 
tic structure  of  GNP.  On  the  other  hand,  rejecting  fractional  processes  removes  a  simple 
explanation  to  the  problems  plaguing  this  area.  The  intriguing  possibility  that  the  con- 
tending sides  had  each  misread  the  true,  fractional,  nature  of  GNP  unfortunately  is  false, 
and  the  disagreement  must  have  another,  perhaps  deeper  source. 

These  results  also  confirm  the  unit  root  findings  of  Campbell  and  Mankiw  (1987), 
Nelson  and  Plosser  (1982),  Perron  and  Phillips  (1987)  and  Stock  and  Watson  (1986).  We 
cannot  reject  the  null  hypothesis  of  a  diff"erence  stationary  short  term  dependent  process. 
As  mentioned  before,  the  significant  autocorrelations  appearing  in  detrended  GNP  indicate 
the  spurious  periodicity  suggested  by  Nelson  and  Kang  (1981).  Moreover,  the  trend  plus 
stationary  noise  model  of  GNP  is  not  contained  in  our  null  hypothesis,  hence  our  failure 
to  reject  the  null  hypothesis  is  also  consistent  with  the  unit  root  model. ■^^  To  see  this, 
observe  that  if  log-GNP  yt  were  trend  stationary,  i.e.: 


*    Of  courBe,  this  may  be  the  result  of  low  power  against  stationary  but  near-integrated  processes,  and  must  be  addressed  by 
Monte  Carlo  experiments 
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yt     =     Q   +   /3<    +    rjt  (4.7) 

where  rjt  is  stationary  white  noise,  then  its  first-difference  Xt  is  simply  Xt  =  0  +  et  where 
€(  =  T]t  -  T]i^i.  But  this  innovations  process  violates  our  assumption  (A3)  and  is  therefore 
not  contained  in  our  null  hypothesis. 

From  the  perspective  of  a  theorist,  the  tests  in  Table  2  provide  more  information  than 
usual  because  they  also  serve  to  reject  a  particular  model.  Recall  the  section's  main  point; 
moving  to  a  multi-sector  model  can  produce  qualitatively  different  output  dynamics.  Our 
test  provides  no  evidence  for  those  dynamics.  The  American  economy's  multi-sector  nature 
does  not  qualitatively  affect  its  output  dynamics.  For  many  questions,  single  sector  models 
provide  a  rich  enough  environment.  Once  again  this  represents  a  simplification;  it  allows 
us  to  avoid  a  broad  and  difficult  class  of  models. 

As  before,  the  results  also  complicate  the  theoretical  picture.  Because  the  American 
economy  has  many  sectors,  rejecting  the  predictions  of  a  multi-sector  model  presents  a  po- 
tential puzzle.  Currently  the  strong  assumptions  needed  to  produce  long  term  dependence 
take  the  edge  off  any  puzzle,  but  these  are  only  sufficient  conditions.  Necessary  conditions, 
or  broader  sufficient  conditions,  are  unknown.  We  don't  know  enough  about  what  pro- 
duces fractional  differencing  or  how  closely  actual  conditions  match  those  requirements  to 
worry  about  not  finding  it.  If  broader  conditions  produce  long  term  dependence  -  and  if 
actual  sector  dynamics  meet  those  conditions  -  then  a  genuine  puzzle  arises. 

Related  research  may  also  turn  up  puzzles,  making  the  lack  of  fractional  processes  in 
GNP  anomalous.  For  example  Haubrich  (1989)  finds  fractional  processes  in  consumption 
across  several  countries.  The  output  results  here  therefore  rule  out  several  natural  expla- 
nations, such  as  a  constant  marginal  propensity  to  consume  out  of  fractionally  differenced 
income. 

From  another  perspective,  our  results  impose  a  discipline  on  future  modelling  at- 
tempts. As  Singleton  (1988)  points  out,  dynamic  macro  models  make  predictions  over 
many  frequencies:  seasonal,  cyclical,  and  longer.  Their  predictions  must  now  conform  to 
the  facts  on  long  term  dependence.  Future  dynamic  models  -  multi-sector  or  otherwise 
-  must  respect  this  constraint,  or  bear  the  defect  of  having  already  been  rejected.  How 
restrictive  this  will  be  is  unknown. 

To  conclude  that  the  data  support  the  null  hypothesis  because  our  statistic  fails  to 
reject  it  is,  of  course,  premature  since  the  size  and  power  of  our  test  in  finite  samples  is 
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yet  to  be  determined.    We  perform  illustrative  Monte  Carlo  experiments  and  report  the 
results  in  the  next  section. 


4.3.  The  Size  and  Power  of  the  Test. 

To  evaluate  the  size  and  power  of  our  test  in  finite  samples,  we  perform  several  illus- 
trative Monte  Carlo  experiments  for  a  sample  size  of  163  observations,  corresponding  to 
the  number  of  quarterly  observations  of  real  GNP  growth  from  1947:2  to  1987:4.'^^  We 
simulate  two  null  hypotheses:  independently  and  identically  distributed  increments,  and 
increments  that  follow  an  ARMA(2,2)  process.  Under  the  i.i.d.  null  hypothesis,  we  fix 
the  mean  and  standard  deviation  of  our  random  deviates  to  match  the  sample  mean  and 
standard  deviation  of  our  quarterly  data  set:  7.9775  x  10~^  and  1.0937  x  10~^  respectively. 
To  choose  parameter  values  for  the  ARMA(2,2)  simulation,  we  estimate  the  model: 

{\  -  4>iL  -  4>2L^)yt     =     ^l   +   {\  +  eiL  +  d2L^)it         tt~WN[0,ol)  (4.8) 

using  nonlinear  least  squares.  The  parameter  estimates  are  [with  standard  errors  in  paren- 
theses]: 

4>i      =       0.5837  Oi      =   -  0.2825 

(0.1949)  (0.1736) 

4>2      =^    -  0.4844  $2      =       0.6518 

(0.1623)  (0.1162) 

ti       =       0.0072 
(0.0016) 

a1      =       0.0102 

Table  3  reports  the  results  of  both  null  simulations. 

It  is  apparent  from  the  "I.I.D.  Null"  Panel  of  Table  3  that  the  5  percent  test  based  on 
the  classical  rescaled  range  rejects  too  frequently.   The  5  percent  test  using  the  modified 


^*  All  simulations  were  performed  in  double  precision  on  a  VAX  8700  using  the  IMSL  10.0  random  number  generator  DRNNOA; 
each  experiment  was  comprised  of  10,000  replications. 

2.7  -  24  -  8.89 


rescaled  range  with  q  =  3  rejects  4.6  percent  of  the  time,  closer  to  its  nominal  size.  As  the 
number  of  lags  increcises  to  8,  the  test  becomes  more  conservative.  Under  the  ARMA(2,2) 
null  hypothesis,  it  is  apparent  that  modifying  the  rescaled  range  by  the  spectral  density 
estimator  d^{q)  is  critical;  the  size  of  a  5  percent  test  based  on  the  classical  rescaled  range 
is  34  percent,  whereas  the  corresponding  size  using  the  modified  R/S  statistic  with  q  =  5 
is  4.8  percent.  As  before,  the  test  becomes  more  conservative  when  q  is  increased. 

Table  3  also  reports  the  size  of  tests  using  the  modified  rescaled  range  when  the 
lag  length  q  is  chosen  optimally  using  Andrews'  (1987)  procedure.  This  data-dependent 
procedure  entails  computing  the  first-order  autocorrelation  coefficient  p(l)  and  then  setting 
the  lag  length  to  be  the  integer- value  of  Mn,  where. 


A^n     ^     K^  &     =     77-^-  (4-9) 


(l-p2)2 


Under  the  i.i.d.  null,  Andrews'  formula  yields  a  5  percent  test  with  empirical  size  6.9 
percent;  under  the  ARMA(2,2)  alternative,  the  corresponding  size  is  4.1  percent.  Although 
significantly  different  from  the  nominal  value,  the  empirical  size  of  tests  based  on  Andrews' 
formula  may  not  be  economically  important.  In  addition  to  its  optimality  properties,  the 
procedure  has  the  advantage  of  eliminating  a  dimension  of  arbitrariness  in  performing  the 
test. 

Table  4  reports  power  simulations  under  two  fractionally  differenced  alternatives:  (1  — 
L)  £(  =  rji  where  d  =  1/3,-1/3.  Hosking  (1981)  has  shown  that  the  autocovariance 
function  7e(fc)  of  e^  is  given  by: 


T(l-2d)T{d  +  k)  2  ,      /     1     l^  ,        ^ 


Realizations  of  fractionally  differenced  time  series  [of  length  163]  are  simulated  by  pre- 
multiplying  vectors  of  independent  standard  normal  random  variates  by  the  Cholesky- 
factorization  of  the  163  x  163  covariance  matrix  whose  entries  are  given  by  (4.10).  To 
calibrate  the  simulations,  a"^  is  chosen  to  yield  unit  variance  €t's,  the  {e^}  series  is  then 
multiplied  by  the  sample  standard  deviation  of  real  GNP  growth  from  1947:1  to  1987:4, 


'^In  addition,  Andrews'  procedure  requires  weighting  the  autocovariances  by  1  -  j^  (j  =  1, . . . ,  [A/„])  in  contrast  to  Newey 
and  West's  (1987)  1  -  -^  [j  =  l,...,fl),  where  q  is  an  integer  and  Mn  need  not  be. 
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and  to  this  series  is  added  the  sample  mean  of  real  GNP  growth  over  the  same  sample 
period.  The  resulting  time  series  is  used  to  compute  the  power  of  the  rescaled  range;  Table 
4  reports  the  results. 

For  small  values  of  q,  tests  based  on  the  modified  rescaled  range  have  reasonable 
power  against  both  fractionally  differenced  alternatives.  For  example,  using  one  lag  the  5 
percent  test  against  the  d  =  1/3  alternative  has  58.7  percent  power;  against  the  d=  —1/3 
alternative  this  test  hcis  81.1  percent  power.  As  the  lag  length  is  increased,  the  test's 
power  declines.  Note  that  tests  baised  on  the  classical  rescaled  range  is  significantly  more 
powerful  than  those  using  the  modified  R/S  statistic.  This,  however,  is  of  little  value  when 
distinguishing  between  long-term  versus  short-term  dependence  since  the  test  using  the 
classical  statistic  also  has  power  against  some  stationary  finite-order  ARMA  processes. 
Finally,  note  that  tests  using  Andrews'  truncation  lag  formula  have  reasonable  power 
against  the  d  =  —1/3  alternative  but  are  considerably  weaker  against  the  more  relevant 
d  —  1/3  alternative. 

The  simulation  evidence  in  Tables  3  and  4  suggest  that  our  empirical  results  do  indeed 
support  the  short-term  dependence  of  GNP  with  a  unit  root.  Our  failure  to  reject  the 
null  hypothesis  does  not  seem  to  be  explicable  by  a  lack  of  power  against  long-memory 
alternatives.  Of  course,  our  simulations  were  illustrative  and  by  no  means  exhaustive; 
additional  Monte  Carlo  experiments  must  be  performed  before  a  full  assessment  of  the 
test's  size  and  power  is  complete.  Nevertheless  our  modest  simulations  indicate  that  there 
is  little  empirical  evidence  in  favor  of  long-term  memory  in  GNP  growth  rates.  Perhaps 
the  direct  estimation  of  long-memory  models  would  yield  stronger  results  and  is  currently 
being  investigated  by  several  authors.'^® 


5.  Conclusion. 

This  paper  hzs  suggested  a  new  approach  to  the  stochastic  structure  of  aggregate 
output.  Traditional  dissatisfaction  with  the  conventional  methods  -  from  observations 
about  the  typical  spectral  shape  of  economic  time  series,  to  the  discovery  of  cycles  at  all 
periods  -  calls  for  such  a  reformulation.  Indeed,  recent  controversy  over  deterministic 
versus  stochastic  trends  and  the  persistence  of  shocks  underscores  the  difficulties  even 
modern  methods  have  of  identifying  the  long  run  properties  of  the  data. 

^'See,  for  example,  Diebold  and  Rudebusch  (1989),  Sowell  (I989a,b),  and  Yajima  (1985,1988). 
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Fractionally  integrated  random  processes  provide  one  explicit  approach  to  the  prob- 
lem of  long-term  dependence;  naming  and  characterizing  this  aspect  is  the  first  step  in 
studying  the  problem  scientifically.  Controlling  for  its  presence  improves  our  ability  to  iso- 
late business  cycles  from  trends  and  to  Jissess  the  propriety  of  that  decomposition.  To  the 
extent  that  it  explains  output,  long-term  dependence  deserves  study  in  its  own  right.  Fur- 
thermore, Singleton  (1988)  has  recently  pointed  out  that  dynamic  macroeconomic  models 
often  link  inextricably  predictions  about  business  cycles,  trends,  and  seasonal  effects.  So 
too  linked  is  long-term  dependence:  a  fractionally  integrated  process  arises  quite  naturally 
in  a  dynamic  linear  model  via  aggregation.  This  model  not  only  predicts  the  existence  of 
fractional  noise,  but  also  suggests  the  character  of  its  parameters.  This  clciss  of  models 
leads  to  testable  restrictions  on  the  nature  of  long-term  dependence  in  aggregate  data,  and 
also  holds  the  promise  of  policy  evaluation. 

Advocating  a  new  class  of  stochastic  processes  would  be  a  fruitless  task  if  its  members 
were  intractable.  In  fact,  manipulating  such  processes  causes  few  problems.  We  con- 
structed an  optimizing  linear  dynamic  model  that  exhibits  fractionally  integrated  noise, 
and  provided  an  explicit  test  for  such  long-term  dependence.  Modifying  a  statistic  of  Hurst 
and  Mandelbrot  gives  us  a  statistic  robust  to  short-term  dependence,  and  this  modified 
R/S  statistic  possesses  a  well-defined  limiting  distribution  which  we  have  tabulated.  Illus- 
trative computer  simulations  indicate  that  this  test  has  power  against  at  least  two  specific 
alternative  hypotheses  of  long-memory. 

Two  main  conclusions  arise  from  the  empirical  work  and  Monte  Carlo  experiments. 
First,  the  evidence  does  not  support  long-term  dependence  in  GNP.  Rejections  of  the  short- 
term  dependence  null  hypothesis  occur  only  with  detrended  data,  and  is  consistent  with 
the  well-known  problem  of  spurious  periodicities  induced  by  log-linear  detrending.  Second, 
since  a  trend-stationary  model  is  not  contained  in  our  null  hypothesis,  our  failure  to  reject 
may  also  be  viewed  as  supporting  the  first-difference  stationary  model  of  GNP,  with  the 
additional  result  that  the  resulting  stationary  process  is  weakly  dependent  at  most.  This 
supports  and  extends  the  conclusion  of  Adelman  that,  at  least  within  the  confines  of  the 
available  data,  there  is  little  evidence  of  long-term  dependence  in  the  business  cycle. 
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Figure  1. 


Autocorrelation  functions  of  an  AR(l)  with  coefficient  0.90  [dashed  line]  and  a  fractionally 
differenced  series  Xt  =  [1  —  L)~  tt  with  differencing  parameter  d  =  0.475  [solid  line].  Al- 
though both  processes  have  a  first-order  autocorrelation  of  0.90,  the  fractionally  differenced 
process  decays  much  more  slowly. 
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Impulse  response  function  [solid  line]  of  the  fractionally  differenced  time  series  Xt  = 
(l  —  L)~  tt  for  differencing  parameter  d  —  0.475.  For  comparision,  the  impulse-response 
function  of  an  AR(l)  with  autoregressive  parameter  0.90  is  also  plotted  [dashed  lines]. 
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Values  of  an  AR(l)'s  autoregressive  parameter  required  to  generate  the  same  fc-th  order 
autocorrelation  as  the  fractionally  differenced  series  Xt  =  {I  —  L)~  et  for  differencing 
parameter  d  =  0.475  [solid  line].  Formally,  this  is  simply  the  fc-th  root  of  the  fractionally 
differenced  series'  impulse-response  function  [dashed  line].  For  large  k,  the  autoregressive 
parameter  must  be  very  close  to  unity. 


2.6.f2b 


1.89 


CRrt-Ej  g. 

»NrBE»    *     .C           A( 

•   :    '9f9      :  1" e'  rv 

o 

^/(v) 

ond 

fv(v) 

CNJ 

■ 

in 

' 

q 

^          / 

l\ 

■ 

\^ 

in 

o 

- 

1 

^ 

/  \ 

■ 

1 

/ 

\ 

o 

• 

/         / 

/      / 

\ 

- 

3 

o 

0. 

1 

2                           ; 

V 

Figure  3. 


Distribution  and  density  function  of  the  range  F  of  a  Brownian  bridge.  Dashed  curves  are 
the  normal  distribution  and  density  functions  with  mean  and  variance  equal  to  those  of  V . 
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Table  la.  Fractiles  of  the  Distribution  Fv{v). 


P{V  <  v) 

.005 

.025 

.050 

.100 

.200 

.300 

.400 

.500 

V 

0.721 

0.809 

0.861 

0.927 

1.018 

1.090 

1.157 

1.223 

p(r  <  v) 

.543 

.600 

.700 

.800 

.900 

.950 

.975 

.995 

V 

v/f 

1 

1.294 

1.374 

1.473 

1.620 

1.747 

1.862 

2.098 

Table  lb.  Symmetric  Confidence  Intervals  About  the  Mean. 


PiV^2- 

-1<V    <y 

/f  +  ^) 

7 

.001 

0.748 

.050 

0.519 

.100 

0.432 

.500 

0.185 
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Table  2. 


R 'S  analysis  of  real  GNP;  y^^^  indicates  log-linearly  detrended  quarterly  real  GNP  from  1947:1  to 
1987:4,  and  Ay*^  indicates  the  first-differences  of  the  logarithm  of  real  GNP.  y^^^  and  Ay^^  are 
defined  similairy  for  the  Friedman  and  Schwartz  series.  The  classical  rescaled  range  V„,  and  the 
modified  rescaled  range  Vn{q)  are  reported. 


Ay« 

Ay" 

Q 
VDet. 

VDet. 

K 

1.25 

1.00 

4.23* 

2.83* 

v^.(l) 

1.07 

0.94 

3.02* 

2.10* 

(%-Bias) 

(17.2) 

(6.6) 

(40.0) 

(35.2) 

V^n(2) 

0.97 

0.93 

2.49* 

1.79 

(%-Bias) 

(29.0) 

(7.3) 

(69.6) 

(58.5) 

^n(3) 

0,93 

0.95 

2.19* 

1.62 

(%-Bias) 

(34.6) 

(4.7) 

(93.5) 

(74.9) 

Vn(4) 

0.92 

1.00 

1.98* 

1.52 

(%-Bias) 

(36.3) 

(-0.1) 

(113.5) 

(86.8) 

^r.(5) 

0.92 

1.07 

1.83 

1.45 

(%-Bias) 

(36.1) 

(-6.4) 

(130.7) 

(95.7) 

^n(6) 

0.92 

1.10 

1.72 

1.40 

(%-Bias) 

(35.3) 

(9.3) 

(145.7) 

(102.7) 

^.(7) 

0.93 

1.12 

1.63 

1.36 

(%-Bias) 

(34.4) 

(-10.8) 

(159.0) 

(107.9) 

^n(8) 

0.94 

1.14 

1.56 

1.34 

(%-Bias) 

(33.2) 

(-12.7) 

(170.9) 

(111.5) 

'  Under  the  null  hypothesis  H  [conditions  (Al),  (A2'),  (A3)-(A5)  of  the  paper],  the  limiting  distribution  of  V„(<j)  is  the  range 
of  a  Brownian  bridge,  which  has  a  mean  of  v/t/2.  Fractiles  are  given  in  Table  1;  the  95  percent  confidence  interval  with  equal 
probabilities  in  both  tails  is  [0.809,  1.862].  Entries  in  the  %-Bia8  columns  are  computed  as  [{V„/V„(q))^/^  -  l]  •  100,  and  are 
estimates  of  the  bias  of  the  clsissical  R/S  statistic  in  the  presence  of  short-term  dependence.  Asterisks  indicate  significance  at 
the  5  percent  level. 
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Table  3. 

Finite  sample  distribution  of  the  modified  R/S  statistic  under  i.i.d  and  ARMA(2,2)  null  hypotheses  for  the  first-difference  of 
real  log-GNP  The  Monte  Carlo  experiments  under  the  two  null  hypotheses  are  independent  and  consist  of  10,000  replications 
each  Parameters  of  the  i.i.d.  simulations  were  chosen  to  match  the  sample  mean  and  variance  of  quarterly  real  GNP  growth 
rates  from  1947  1  to  1987:4,  parameters  of  the  ARMA(2,2)  were  chosen  to  match  point  estimate*  of  an  ARMA(2,2)  model 
fitted  to  the  same  data  set  Entries  m  the  column  labelled  "q'  indicate  the  number  of  lags  used  to  compute  the  R/S  statistic; 
a  lag  of  0  corresponds  to  Mandelbrot's  classical  rescaled  range,  and  a  non-integer  lag  value  corresponds  to  the  average  (across 
replications)  lag  value  used  according  to  Andrew's  (1987)  optimal  lag  formula  Standard  errors  for  the  empirical  sire  may  be 
computed  using  the  usual  normal  approximation;  they  are  9.95  x  10~*,  2.18  x  10~^,  and  3.00  x  10"^  for  the  1,  5,  and  10 
percent  tests  respectively. 


I.I.D.  Null  Hypothesis: 


n 

<? 

Mm 

Max 

Mean 

S.  D. 

Sire  1%-Te8t 

Siie  6%-Test 

Size  10%-Te8t 

163 

0 

0522 

2457 

1.167 

0.264 

0.022 

0.081 

0  138 

163 

15 

0.525 

2457 

1.171 

0.253 

0.015 

0.069 

0.121 

163 

1 

0.533 

2.423 

1  170 

0.254 

0016 

0.069 

0.125 

163 

2 

0.564 

2.326 

1.174 

0.246 

0.011 

0.058 

0.111 

163 

3 

0.602 

2.221 

1.179 

0.239 

0.009 

0.046 

0.097 

163 

4 

0.641 

2.136 

1.184 

0.232 

0.006 

0.036 

0.082 

163 

5 

0.645 

2.087 

1.189 

0.225 

0.004 

0.030 

0.071 

163 

6 

0.636 

2.039 

1.193 

0.219 

0.002 

0.024 

0.061 

163 

7 

0.648 

1.989 

1.198 

0.213 

0.000 

0.018 

0.050 

163 

8 

0.657 

1.960 

1.203 

0.207 

0.000 

0.015 

0.040 

ARMA(2,2)  Null  Hypothesis: 


n 

1 

Min 

Max 

Mean 

S.  D. 

Site  1%-Te6t 

Size  5%-Test 

Size  10%-Te8t 

163 

0 

0.746 

3.649 

1.730 

0.396 

0.175 

0.340 

0.442 

163 

6.8 

0.610 

2.200 

1.177 

0.229 

0.009 

0.041 

0.084 

163 

1 

0.626 

3.027 

1.439 

0.821 

0.034 

0.110 

0.182 

163 

2 

0.564 

2.625 

1.273 

0.279 

0.010 

0.054 

0.111 

163 

3 

0.550 

2.412 

1.202 

0.257 

0.012 

0.055 

0.106 

163 

4 

0.569 

2.294 

1.180 

0.244 

0.012 

0.054 

0.102 

163 

5 

0.609 

2.241 

1.178 

0.236 

0.010 

0.048 

0093 

163 

6 

0.616 

2.181 

1.180 

0.229 

0.008 

0.040 

0.082 

163 

7 

0.629 

2.109 

1.180 

0.222 

0.006 

0.034 

0.073 

163 

8 

0.644 

2.035 

1.179 

0.215 

0.005 

0.030 

0.066 
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Table*. 

Power  of  the  modified  R/S  statistic  under  a  Gaussian  fractionally  differenced  alternative  with  differencing  parameters  d  = 
1/3,  —1/3  The  Monte  Carlo  experiments  under  the  two  alternative  hypotheses  are  independent  and  consist  of  10,000  replications 
each  Parameters  of  the  simulations  were  choeen  to  match  the  sample  mean  and  variance  of  quarterly  real  GNP  growth  rates 
from  19471  to  1987:4  Entries  in  the  column  labelled  'q"  indicate  the  number  of  lags  used  to  compute  the  R/S  statistic;  a 
lag  of  0  corresponds  to  Mandelbrot's  classical  rescaled  range,  and  a  non-integer  lag  value  corresponds  to  the  average  (across 
replications)  lag  value  used  according  to  Andrew's  (1987)  optimal  lag  formula. 


1/3: 


n 

9 

Min 

Max 

Mean 

S.  D. 

Power  1%-Test 

Power  6%-Te6t 

Power  10%-Test 

163 

0 

0.824 

4.659 

2.870 

0.612 

0.637 

0.778 

0.839 

163 

6.0 

0,702 

2.513 

1.524 

0.286 

0.017 

0.126 

0.240 

163 

1 

0.751 

S.657 

2.004 

0.478 

0.416 

0.687 

0.680 

163 

2 

0.721 

3.140 

1.811 

0.409 

0.254 

0.448 

0.545 

163 

3 

0.708 

2.820 

1  688 

0.363 

0.141 

0.331 

0.440 

163 

4 

0.696 

2.589 

1.600 

0.330 

0.068 

0.234 

0.350 

163 

5 

0.700 

2  417 

1.534 

0.304 

0.027 

0.158 

0.271 

163 

6 

0.700 

2.297 

1.482 

0.282 

0.008 

0.096 

0.201 

163 

7 

0.699 

2.195 

1.440 

0.264 

0.001 

0.056 

0.141 

163 

8 

0.694 

2.107 

1.405 

0.249 

0.000 

0.027 

0.097 

d   =    -1/3: 


n 

9 

Min 

Max 

Mean 

S.  D. 

Power  1%-Test 

Power  6%-Test 

Power  10%-Te8t 

163 

0 

0.352 

1.080 

0.614 

0.103 

0.849 

0.956 

0.981 

163 

4  1 

0.449 

1.626 

0.838 

0.142 

0.211 

0.456 

0.600 

163 

1 

0.416 

1.251 

0.708 

0.116 

0.687 

0.811 

0.895 

163 

2 

0.456 

1.344 

0.779 

0.125 

0.350 

0.631 

0.758 

163 

3 

0.612 

1.467 

0.837 

0.132 

0.194 

0.458 

0.612 

163 

4 

0.546 

1.545 

0.887 

0.137 

0.100 

0.309 

0.471 

163 

5 

0.564 

1.667 

0.931 

0.141 

0.046 

0.200 

0.334 

163 

6 

0.600 

1.664 

0.970 

0.144 

0.019 

0.124 

0.236 

163 

7 

0.638 

1.731 

1.007 

0.147 

0.008 

0.074 

0.158 

163 

8 

0.652 

1.775 

1.041 

0.149 

0.004 

0.041 

0.103 

2  74 


8.89 
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